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ABSTRACT 

Rare-cleaving endonucleases have emerged as im- 
portant tools for making targeted genome modifica- 
tions. While multiple platforms are now available to 
generate reagents for research applications, each 
existing platform has significant limitations in one 
or more of three key properties necessary for thera- 
peutic application: efficiency of cleavage at the 
desired target site, specificity of cleavage (i.e. rate 
of cleavage at 'off-target' sites), and efficient/facile 
means for delivery to desired target cells. Here, we 
describe the development of a single-chain rare- 
cleaving nuclease architecture, which we designate 
'megaTAL', in which the DNA binding region of a 
transcription activator-like (TAL) effector is used to 
'address' a site-specific meganuclease adjacent to 
a single desired genomic target site. This architec- 
ture allows the generation of extremely active and 
hyper-specific compact nucleases that are compat- 
ible with all current viral and nonviral cell delivery 
methods. 



INTRODUCTION 

Targeted genome modifications can be acliieved using rare 
cleaving nucleases, such as zinc-finger nucleases (ZFNs), 
transcription activator-like effector nucleases (TALENs), 
meganucleases (mns, also termed homing endonucleases) 
and clustered, regularly interspaced, short palindromic 



repeat (CRISPR) RNA-guided nucleases (1-9). These 
reagents allow precise alterations to be introduced at 
their DNA recognition sites through cleavage of both 
DNA strands to yield double-strand breaks (DSBs) that 
are recognized and processed by the nonhomologous end 
joining (NHEJ) or homologous recombination (HR) 
DNA repair pathway (10-12). Repair by NHEJ involves 
rehgation of the DNA ends, which, after repeated 
cleavage, is often accompanied by mutagenesis via the 
introduction of small insertions or deletions (indels) at 
the site of the DSB that can result in the disruption of 
the coding sequence of a gene. In contrast, repair by HR 
results in seamless modification of the genome by copying 
an appropriate homologous DNA template at the site of 
the DNA break. 

The most demanding application of rare-cleaving nucle- 
ases is their use for therapeutic genome editing. For these 
applications, nuclease reagents must be delivered to a 
desired target cell in sufficient quantity to efficiently 
generate the desired DNA modification, while maintaining 
exquisite specificity to avoid compromising the genomic 
integrity of the cell. While the latest advances in each of 
the above-mentioned nuclease platforms enable quahty 
nuclease reagents that are suitable for nearly all research 
and nontherapeutic commercial applications to be 
generated for httle cost (13-20), several considerations 
still remain that limit their use for human cell therapies. 
These include a significant potential for off-target 
cleavage, particularly if expressed at high levels, as has 
been observed in the therapeutic application of ZFNs 
for disruption of the CCR5 gene in human T-cells 
(21-23). Additionally, since the Fokl cleavage domain 
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used in ZFNs and TALENs is activated via dimerization, 
ZFN and TALEN technologies require the construction 
and dehvery of two protein halves, each comprising a 
distinct DNA binding domain fused to Fokl, for a 
single target site. This architectural requirement limits 
the potential for ZFN and TALEN dehvery into 
primary cells using viral vectors as well as their simultan- 
eous (i.e. multiplexed) delivery of two or more nucleases to 
modify more than one gene (24,25). Although CRISPR 
systems have shown exciting promise in developing 
reagents for genome engineering with easily designed 
target recognition, they exhibit high off-target cleavage 
when used with single-guide RNAs, thus necessitating 
the use of nicking enzymes and dual guide RNAs per 
target to achieve specificity equivalent to that of the 
other major platforms (26-28). The consequent need to 
deliver both the Cas9 ORE plus two guides per target 
would appear to pose significant limitations for the use 
of CRISPR in therapeutic applications requiring viral vec- 
torization as well as for applications requiring multiplex- 
ing. Finally, although architecturally well suited for 
therapeutic applications, mns' tightly coupled cleavage 
and binding activity have limited their potential for re- 
design toward novel DNA target sites, with the majority 
of redesign efforts resulting in respecified enzymes with 
low target site affinity and consequent low overall 
activity (29,30). 

To address the limitations of present nuclease plat- 
forms, we have developed a hybrid nuclease architecture 
that combines the ease of engineerability of a TAL 
effector (TALE) with the cleavage sequence specificity of 
a mn cleavage domain. This 'megaTAL' architecture was 
achieved by fusing minimal TAL effector domains to the 
N -terminus of mns derived from the LAGLIDADG 
homing endonuclease family. 

MATERIALS AND METHODS 

megaTAL and mn construct generation 

MegaTALs were constructed using the Golden Gate 
assembly strategy previously described by Cermak et ai, 
using an repeat variable diresidue (RVD) plasmid library 
and destination vector generously provided by the Voytas 
lab (17). Plasmids were modified to allow the assembly of 
TAL effectors 1.5-10.5 RVDs long. The pthXOl destin- 
ation vector was modified to include a hemagglutinin 
(HA) tag immediately downstream of the nuclear 
localization signal (NLS) and to yield a NA154, C+63 
TALEN scaffold. TAL effectors were built using the fol- 
lowing RVDs to target each specific nucleotide: A — NI, 
C— HD, G— NN and T— NG. Following cloning of the 
TAL effector repeats into the destination vector, the 
protein linkers (Fok: SQLVKS, Zn4: VGGS, Flexl: 
GGSG or Flex2: GGSGGGSG) and I-Anil or I-Onul 
mn variants were cloned in place of the Fokl nuclease 
catalytic domain between the Xba-1 and Sal-I restriction 
sites. All megaTALs, except those used to compare hnker 
activity, were made using the Zn4 protein linker. The 
codon diverged 6.5C-WT megaTAL (Supplementary 
Figure S2d) was constructed by gene synthesizing a TAL 



effector repeat array (Invitrogen) that was then cloned 
between Age-I and Xho-I sites cloned into the Golden 
Gate destination vector by site-directed mutagenesis. 
Control constructs expressing mn variants were made by 
cloning the nuclease downstream of an NLS and HA tag 
between the Sbf-I and Sal-I restriction sites. AU con- 
structs, except those used for mRNA production, encode 
BFP-T2A-nuclease for tracking of nuclease expression 
during flow cytometry. An example of a full megaTAL 
protein sequence (L538-Zn4-Y2 I-Anil) as well as the 
sequence for each mn tested and the RVDs used 
for each synthetic TAL effector are provided in 
Supplementary Figure S6. 

Lentivirai production 

Lentivirus (LV) and integration-deficient LV (IDLV) were 
generated as previously described (31). Briefly, 9x10^ 
cells were transfected in 10-cm dishes with 6|.ig of the 
nuclease construct, 3 |.ig of psPAX2 viral packaging 
plasmid and 1.5 ng of either pMD2G (for LV) or integra- 
tion-deficient D64V pMD2G (for IDLV) viral envelope 
plasmid using PEI (Polysciences). Two days after transfec- 
tion, viral supernatants were collected and concentrated 
by centrifugation. Viral titers were determined from the 
population of blue fluorescent protein (BFP) expressing 
cells 72 h after transducing 293T cells with varying 
volumes of concentrated virus using polybrene. 

Cell line derivation 

HEK293T cell fines were generated harboring either the 
traffic hght reporter [TLR 2.1, (32)] or a modified traffic 
fight reporter (TLR) containing an near-infrared 
fluorescent protein (iRFP) gene in place of puromycin 
[epigenetic TLR, (33)]. Cells were derived as previously 
described with shght modifications. Briefly, HEK293T 
cells were transduced with recombinant LV to yield 5- 
10% transduction, based on either iRFP expression (epi- 
genetic TLR) or cell survival (TLR 2.1). Approximately 5 
days after transduction, cefls were sorted for iRFP+/ 
mCherry— populations (epigenetic TLR) or mCherry— 
populations (TLR 2.1). 

Cell sorting and flow cytometry 

Cells were analyzed by flow cytometry on the BD LSRII 
and sorted on the BD FACS ARIAII. Fluorophores were 
detected using the following lasers and filters: mCherry — 
excited 561 nm, acquired 610/20; niTagBFP — excited 
405 nm, acquired 450/50; cGEP— excited 488 nm, 
acquired 525/50; iRFP — excited 640 nm, acquired 730/ 
45. Data were analyzed using FlowJo software. 

TLR assay 

The TLR assay was performed as previously described 
with slight modifications. Cells harboring the TLR or epi- 
genetic TLR 2.1 were plated at 1-2.0 x 10^ ceUs/weU in a 
24-well dish 24 h before transfection or transduction. 
XtremeGene9 (Roche) was used at 2|il/|ig DNA to trans- 
feet cells with 0.5 [ig of both nuclease and green fluorescent 
protein (GFP) donor constructs. Transductions were 
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performed at an MOI of 5-10 using polybrene. Cells were 
harvested 72 h after transfection/transduction and read on 
the flow cytometer. Data were obtained from BFP+ (TLR 
2.1) or iRFP+/BFP+ populations (epigenetic TLR). 

Western blot analysis 

Reporter 293T cells were collected 72 h after treatment 
and lysed using radioimmunoprecipitation assay (RIPA) 
lysis buffer with protease inhibitors, and then lysates were 
run on a QiaShredder column (Qiagen). Protein concen- 
trations were determined using the BioRad protein assay 
reagent and 5-10 ng of protein was run on a 10% acryl- 
amide gel alongside Kaleidoscope protein ladder (BioRad) 
then transferred onto an Immobilo-FL membrane 
(Millipore) using semidry transfer. The membrane was 
blocked using Licor blocking buffer before staining with 
primary rabbit anti-HA (CeU Signaling Technology) and 
mouse anti-P-actin monoclonal antibodies and secondary 
PE-conjugated anti-mouse (Medial & Biological 
Laboratories) and GFP-conjugated anti-rabbit 
(Invitrogen) monoclonal antibodies. Membrane fluores- 
cence was then detected using the Licor Odyssey. 

Polymerase chain reaction analysis of TAL effector size 

Reporter 293T cells were collected 72 h after lentiviral 
transduction, and genomic DNA was extracted from 
these cells using the Qiagen Blood and Tissue Kit. 
Nested polymerase chain reaction (PGR) was performed 
using two sets of primers to ampHfy the TAL effector 
repeats from the integrated megaTAL ORFs as weU as 
the original megaTAL constructs for comparison. The 
PGR products from the LV-treated samples were cloned 
into the pJET vector (Glonejet) and transformed into 
DH5a cells. For each sample, 47 colonies were picked 
and subjected to colony PGR to ampHfy the pJET 
insert. PGR products were run on a 1.2''/o agarose gel 
for size determination. 

In vitro production of synthetic mRNA 

We used the T7-Scribe kit (GeUScript) for production of 
synthetic mRNA. Two different templates were utilized 
for in vitro transcription: a linearized plasmid (megaTAL 
and HE) or a PGR product containing a T7 promoter 
(Trex2), generated with the following primers: Trex2 
Forward, GGATGG TAATAGGAGTGAGTATAGGG G 
GGGGGAGGATGTGTGAGGGAGGTGGGGGTGAG; 
Trex2 reverse, jjjjjjjjjjtjjjjjtjjjtjjtjjj 

with T7 promoter underlined. The resultant mRNA was 
purified with the RNeasy miniprep columns (Qiagen) and 
a 5' cap was enzymatically added using the vaccinia 
capping system combined with a Gap 2'-0- 
Methyl transferase enzyme (both from NEB). A poly(A) 
taiUng kit (Life Technologies) was used to add a 
polyadenylation tail to 5'-capped mRNA and mRNA 
was purified using the RNeasy kit. The capped/ 
polyadenylated mRNA was eluted in sterile water and 
stored at -20°. 



Human T-cell culture and transfection 

We used the EasySEP Human T-cell enrichment kit 
(Stemcell technologies) to purify T-cells from human per- 
ipheral blood mononuclear ceUs obtained from anonym- 
ous donors at the Puget Sound Blood Genter. Isolated 
T-ceUs were stimulated with GD3/GD28 activation 
beads (Life Technologies) based on manufacturer's 
instructions. Activated T-cells were cultured in RPMl 
media supplemented with 10% fetal bovine serum, Ix 
GlutaMAX, 55|.iM P-mercaptoethanol, 100 U/ml penicil- 
lin, 100|ig/ml streptomycin, lOmM HEPES and lOng/ml 
IL-2 (Biolegend). The beads were magnetically removed 
after 36 h of incubation and the T-ceU resuspended in fresh 
culture media for 6-12 h before electroporation (48-56 h 
total culture time before electroporation). The Neon 
transfection system (Life Technologies) was used for elec- 
troporation, with the 10 ^1 electroporation kit used for aU 
transfections. T-cells were washed with PBS and resus- 
pended in Buffer T at 2 x 10^ cells/ml, with 2x10^ ceUs 
used per transfection. Approximately 1-1.5 (ig of each 
individual mRNA was added to each sample before elec- 
troporation. A maximum of 2.5 |ig of total mRNA was 
used for samples that had more than mRNA species 
(i.e., megaTAL with Trex2). Gells were electroporated 
with the following conditions: three pulses, 1400 V/ 
pulse, with 10 ms pulse width. After electroporation, 
cells were immediately dispersed into prewarmed, 
antibotic-free T-ceU media with 1L2 and cultured for 4-6 
days, with regular media changes, before analysis. 
Analysis of the T-cell receptor expression was performed 
using the aGD3-AlexaFluor 488 antibody (Biolegend, 
clone SK7). 

High-throughput sequencing and analysis 

High-throughput sequencing was performed on cells 
sorted for either BFP expression (293T ceUs) or which 
were GD3- (T-ceUs, except control whole population). 
PGR primers were designed to target ~ 150-200 bp on 
each side of the putative megaTAL cut site and append 
lUumina sequencing primer adaptors. Amplicons for aU 
samples including negative controls were then subjected 
to an additional round of amplification to append 
Hbrary barcodes as weU as outermost flowcell sequences. 
This setup allowed for library indexing at both the 
barcode and amplicon level. Final libraries were 
sequenced on 1.25 runs of a MiSeq (version 2) using 
paired end 250 bp read chemistry with a 9-bp index read. 
Reads for each de-multiplexed hbrary were aligned to 
their respective amplicon references using the Phaster 
read aligner (Phil Green, personal communication) 
owing to its superior ability to accurately ahgn reads 
with large indel events. Each ahgned read pair was then 
individually genotyped for the presence of indels. 
Statistical analysis for indel frequency and skewing 
(bias toward mn or TALE region) were performed using 
a permutation analysis (100000 iterations) and testing 
against the nuU hypothesis (negative controls, or lack of 
skew). 
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Statistical analysis 

Error bars on graphs represent s.e.m. f-values were 
calculated using Student's one-tailed unpaired t-test 
to compare activity of megaTALs with their specific mn 
counterpart (*/'<0.05 shown as, **/"< 0.005, 
***P< 0.0005). 



RESULTS 

Designing a TAL effector — mn fusion protein 
architecture: megaTAL 

Based on the recently proposed DNA binding mechanism 
of TAL effectors, in which binding is initiated by a series 
of degenerate TALE repeats that lack sequence specificity, 
followed by anneahng of the sequence-specific RVD array 
(34-36), we chose to evaluate a TALE-mn fusion architec- 
ture (schematized in Figure la), hereafter designated as 
the 'megaTAL' architecture. We hypothesized that this 
architecture would 'address' the mn, which has intrinsic 
specificity for a small set of target sites, to a single desired 
target, and thus substantially enhance cleavage at that site, 
without nonspecifically enhancing the activity at related 
'nonaddressed' targets. 

To evaluate the feasibihty of this approach, we 
generated an initial set of megaTALs consisting of a syn- 
thetic TAL effector [L538, (37)] fused through a short 
peptide linker to the mn 1-Anil [hereafter WT, (38)], as 
weU as a catalytically inactive 1-Anil mutant [hereafter 
E148D, (20)]. The nuclease activity of each megaTAL 
was tested in parallel with the corresponding stand-alone 
1-Anil mn using ceU hues containing TLRs with a series of 
target sites possessing varying spacers between the TAL 
effector and mn target sites (Supplementary Figure SI a). 
In these assays, the WT 1-Anil megaTAL exhibited up to 
18- and 24-fold increases in levels of mutagenic NHEJ 
(mutNHEJ) and gene targeting, respectively, relative to 
the stand-alone WT 1-AniI (Figure lb). Importantly, 
megaTAL activity was found to be spacer dependent, 
with the maximum increase in activity achieved against a 
target containing a 7-bp spacer (assuming a 20-bp mn 
target). No discernible differences in activity were 
observed between megaTALs constructed with four differ- 
ent protein Unkers (Supplementary Figure Sib). 

Effect of TALE array repeat number on megaTAL 
activity 

Because the Fokl catalytic domain (used in traditional 
TALENs) has low intrinsic affinity for DNA and lacks 
sequence specificity for DNA cleavage (2,5), both the 
affinity and specificity of a TALEN for an intended 
target site are entirely dependent on the DNA binding 
specificity of the TALE RVD arrays, necessitating RVD 
array lengths of >13 TALE repeats to avoid toxicity 
related to nonspecific binding (19). In contrast, because 
meganucleases have a high degree of both intrinsic 
affinity and specificity for a smaU group of target sites 
(6,7), we hypothesized that megaTALs could be con- 
structed with much shorter RVD array lengths. 
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Figure 1. DNA spacer requirement for megaTAL fusions, 
(a) Scliematic representation of a megaTAL. The megaTAL arctiitec- 
ture involves fusion of a TAL effector with truncated N- and 
C-terminal domains through a short hnker to the N-terminus of a 
mn (I-Anil, which displays a Kd of approximately 90 nM for its 
cognate target site). Aligned below the megaTAL schematic is the 
DNA sequence for the L538-I-Anil megaTAL target with a spacer 
length of 7 bp, the L538 DNA target underlined in green, the I-Anil 
DNA target underlined in blue and the DNA spacer separating the two 
outlined with a black box. (b) Increase in levels of mutagenic NHEJ 
(mutNHEJ) and gene targeting (GT) achieved by I-Anil megaTAL 
fusion. Results shown are derived from assays of cleavage activity 
using TLR reporter 293T cells treated 72 h with the L538-Zn4-WT 
megaTAL (mT) or WT I-Anil mn across targets with different spacer 
lengths or a 'scrambled' (scr) I-Anil target. Cleavage activity for L538- 
WT megaTAL for a given spacer length is normalized to the activity of 
the WT mn level (represented by a dashed line) across all cell lines. For 
all graphs, error bars indicate s.e.m and P-value comparing megaTALs 
with their appropriate mn counterpart is indicated by asterisk 
(*P<0.05. **/>< 0.005, ***/>< 0.0005). 



To evaluate the relationship between TALE RVD array 
length and megaTAL activity, we generated 1-AniI 
megaTALs with truncated versions of the L538 TAL 
effector ranging from 2.5 to 16.5 repeats long. These 
megaTALs were tested against their appropriate target, 
each with a 7-bp spacer (Supplementary Figure S2a), 
alongside the stand-alone WT I-Anil mn using the TLR 
reporter lines generated above (Figure 2a, Supplementary 
Figure S2b). All megaTALs were functional and none 
exhibited any detectable toxicity, and a megaTAL made 
with only 3.5 repeats showed a significant increase in levels 
of both gene targeting and mutNHEJ. Cleavage activity 
rose in close correlation with the number of TAL effector 
repeats, with megaTALs made by fusion of a 7.5 RVD 
TALE with the low affinity WT mn reaching near 
maximal activity. We observed an idiosyncratic loss of 
efficacy with the 6.5 RVD TALE, which subsequent 
analyses demonstrated could be attributed to the inter- 
action between the final RVD and its target base 
(SuppleiTientary Figure S2c). 

By reducing the number of TAL effector repeats present 
in a inegaTAL, we hypothesized that the DNA sequence 
encoding the TALE domain could be sufficiently diverged 
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Figure 2. RVD array requirement for megaTAL fusions, (a) Level of 
cleavage activity (assayed by mutNHEJ, shown in red, and gene tar- 
geting, shown in green) of megaTALs built with TAL effectors with 
different number of repeat units. Cleavage activity of TALE-Zn4-WT 
megaTALs built with a varying number of RVD array units was 
normalized to the activity of the WT I-Anil mn (represented by a 
dashed line) across all cell lines tested, (b) Level of cleavage activity 
(assayed by mutNHEJ) of megaTALs built with either a codon 
diverged or nondiverged 6.5 RVD TAL effector array delivered to 
reporter cells by DNA transfection or lentiviral delivery. Both the 
codon diverged (CD) and nondiverged (GG) megaTALs were able to 
rescue the activity of the WT mn using plasmid DNA (pDNA) trans- 
fection, lentiviral (LV) or integration-deficient lentiviral (IDLV) trans- 
duction. However, delivery using lentiviral vectors resulted in 
significantly lower activity in cells treated with the nondiverged 
version of the megaTAL. (c) Proportion of intact (full length) and 
recombined (short) megaTALs integrated into the genome of 293T 
cells. Either the codon diverged (CD) or nondiverged (GG) 6.5 RVD 
WT megaTAL were delivered to cells by lentiviral transduction and the 
viral integrants were assessed using clonal PCR amplification. 



SO as to reduce its repetitiveness and thus allow lentiviral 
packaging and delivery of the nuclease transgene without 
recombination of its open reading frame (ORF). We tested 
this by constructing a codon diverged megaTAL with a 6.5 
RVD array TAL effector (designated CD for 'codon 
diverged', sequence in Supplementary Figure S2d) and 
comparing its cleavage activity to a nondiverged 
(designated GG for 'golden gate') version using both 
lentiviral delivery and DNA transfection. A 1.6- and 2.1- 
fold difference in mutNHEJ levels was observed in 
reporter ceUs treated with an lentiviral or integration-de- 
ficient lentiviral vector carrying the CD or GG megaTAL 



version, suggesting a difference in either delivery or ex- 
pression efficiencies (Figure 2b). Western blot analysis of 
the megaTAL and mn treated cells revealed expression of 
the full-length product in diverged megaTAL treated cells, 
whereas those that received the nondiverged version ex- 
pressed protein of different sizes (Supplementary Figure 
2e). This observation was confirmed using clonal PCR 
analysis of the TAL effector RVD array from CD and 
GG megaTAL treated samples, which indicated that 97 
and 20% of the integrated ORFs from the respective cell 
populations encoded megaTAL proteins with intact 
TALE arrays (Figure 2c). 

Effect of megaTALs on mn target affinity and 
cleavage activity 

The most straightforward explanation for the increased 
activity observed with the I-Anil megaTAL compared 
with the stand-alone mn is that the TALE array effectively 
increases the local concentration of I-Anil by its adjacent 
cleavage site. We hypothesized that fusing TAL effector 
DNA binding domains to mns would thus generally act to 
potentiate the activity of mns toward targets for which 
they had low affinity, but not toward those for which 
they displayed slow cleavage kinetics. To test this hypoth- 
esis, we further evaluated the behaviors of (i) a series of 
megaTALs generated from mns with varying affinities for 
the LAnil native target, and (ii) a single megaTAL at 
target sites for which the mn exhibits differing cleavage 
kinetics. 

We generated megaTALs using two additional I-Anil 
variants (hereafter F13Y and Y2) with shghtly increased 
affinities for the native 1-AniI target site (Kd = 90 nM, 
for LAnil WT; Kd = 30nM, for I-Anil F13Y; 
Kd = 8 nM, for I-Anil Y2) (38). MegaTALs made with 
the moderate (F13Y) and high (Y2) affinity mn variants 
also showed significant increases in activity (Figure 3a, 
Supplementary Figure Sib and c), albeit to a lesser 
degree (< 13-fold increase for F13Y, <4-fold for Y2) 
than observed with the original I-Anil megaTAL. Thus, 
the effect of TALE array addressing correlates with the 
intrinsic affinity of the mn variant: low affinity variants 
are boosted to a much greater extent than those with in- 
trinsically higher affinity, consistent with the idea that the 
TAL effector domain influences nuclease activity by 
increasing the local concentration of the mn cleavage 
domain at its addressed target site. Additionally, the 
moderate affinity mn F13Y reached a plateau of activity 
with a shorter (4.5 RVD) TALE array relative to the 
TALE array length needed (5.5-7.5 RVD) to maximize 
the activity of the lower affinity WT 1-Anil mn 
(Supplementary Figure S2b). 

We then further defined the TALE 'addressing' effect by 
testing the activity of the high affinity Y2 mn and L538-Y2 
megaTAL against a series of DNA target variants that 
harbor single base-pair substitutions (relative to the 
native l-Anil target), for which the kcat* and Km* of the 
Y2 variant have previously been determined (Figure 3b) 
(39). Nonnative targets were chosen for which Y2 displays 
either 'high Km*' (Y2 exhibits a siinilar kcat* but reduced 
affinity relative to the original target site) or 'low kcat*' 
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Figure 3. megaTAL addressing acts by increasing the effective mn concentration at the addressed target site, (a) Level of cleavage activity (assayed 
by mutNHEJ) of megaTALs built with mns with varying affinities in TLR cell lines containing targets with different DNA spacer lengths. The level 
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(Y2 exhibits a reduced kcat* but comparable binding 
affinity compared with the original target site). Cleavage 
activity was measured using TLR reporter cells with 
targets composed of the L538 binding site and the indi- 
vidual high Km*, low k^at* or wt I-Anil targets, separated 
by an 8-bp spacer. As expected based on its known 



biochemical properties, the stand-alone Y2mn showed 
low or background levels of activity against all six nonna- 
tive targets. In contrast, cleavage activity was considerably 
increased (12-38-fold for HR, 8-27-fold for mutNHEJ) in 
all high Km* targets treated with their corresponding 'ad- 
dressed' megaTAL, while little or no effect was observed 



Nucleic Acids Research, 2014, Vol. 42, No. 4 2597 



for the low k^at* targets (0-3-fold for GT and mutNHEJ) 
(Figure 3b and d). However, megaTAL activity against 
the low kcat* target -IG was found to be significantly 
increased compared with mn treatment, suggesting that 
in some cases 'addressing' may partially compensate for 
reduced catalytic activity. 

Increasing on-target cleavage with megaTALs 

One of the major shortcomings of current nuclease plat- 
forms is their potential for off-target cleavage and the 
negative effects that this may have on the efficacy or 
safety profile of a putative nuclease-based therapeutic ap- 
plication. To evaluate the specificity properties of the 
megaTAL architecture, we tested whether a megaTAL 
could consistently distinguish between 'addressed' and 
'nonaddressed' genomic target sites, aU cleaved by a 
common mn cleavage domain. Near-native sites of the 
I-Anil target (DNA sequences containing three or fewer 
base-pair differences from the native I-Anil recognition 
sequence) were previously identified in the human 
genome using the UCSC BLAT tool (40). The cleavage 
activity of the Y2 LAnil variant mn was assessed at these 
near-natives targets using both episomal and endogenous 
targets (data not shown) and the five sites that displayed 
the highest levels of cleavage by Y2 in those assays were 
chosen for further examination (Figure 4a). Synthetic 
TAL effectors specific to regions upstream of the +9T 
and +5A+8T near-native targets were built and fused to 
the WT, F13Y and Y2 I-Anil mns. These megaTALs were 
tested alongside the stand-alone mn variants in TLR 
reporter cell lines for each of the five targets. The +9T 
and +5A+8T megaTALs showed increased levels of gene 
targeting at their addressed targets only, while having no 
observed effect on the activity at the remaining sites 
(Figure 4b). Notably, while the addition of a sequence- 
specific TAL effector domain to the high-affinity Y2mn 
did not reduce its cleavage at nonaddressed targets (the 
Y2mn and megaTALs show similar levels of GT at off- 
target sites), treatment with megaTALs made with the 
lower affinity WT and F13Y mns resulted in levels of 
gene targeting at addressed targets equivalent to that 
achieved with the Y2 megaTALs, but without detectable 
off-target activity. 

To verify that a similar effect would be observed at 
the endogenous near-native loci, 293T cells were treated 
with +9T megaTALs, +5A+8T megaTALs and mn 
variants, and the level of mutNHEJ was determined at 
each near-native site by high-throughput sequencing 
(Figure 4c, Supplementary Figure S3a and b). These 
results mirrored those from the reporter assay, with the 
WT and F13Y megaTALs achieving indel rates up to 
75% at their addressed sites, while maintaining near- 
background activity at nonaddressed sites. 

Highly effective knockout of the T-cell receptor alpha 
gene in primary T-cells with a megaTAL 

To verify that the benefits observed when fusing wild-type 
mns to a TAL effector would be achievable with an 
engineered mn and could be captured in a primary cell 
type in a relevant therapeutic application, we designed a 



variant of the I-Onul mn to knockout the T-cell receptor 
alpha (TCRa) gene. Knockout of the TCRa gene in 
human T-cells has been proposed as an approach to 
improve the safety of T-cells endowed with chimeric 
antigen receptors or heterologous TCRs, by preventing 
them from generating graft-versus-host disease (41). 
Thus, knockout of TCRa in human T-cells is among the 
most translationally relevant applications of rare cleaving 
nuclease technology. 

In vitro characterization of the TCRa mn showed that 
its target site affinity and cleavage activity were similar to 
that of its parent enzyme. Specificity proflhng of the 
TCRa mn and subsequent high-throughput in vitro 
cleavage analysis led to the identification of a number 
of potential off-target sites, which were cleaved with com- 
parable efficiency to the desired target (Supplementary 
Figure S4a and e). To determine if generation of a 
TCRa megaTAL would yield improvements in activity 
and specificity, the TCRa mn was fused to a 10.5 repeat 
TALE array designed to bind a DNA sequence upstream 
of the TCRa mn binding site (Figure 5a). T-ceUs were then 
transfected with mRNA encoding either the TCRa 
megaTAL or the stand-alone mn. In order to increase 
the efficiency of gene knockout, samples were included 
wherein either the mn or the megaTAL were co-trans- 
fected with niRNA encoding Trex2, a 3' to 5' exonuclease 
that has been shown to significantly increase rates of 
mutNHEJ at mn cleavage sites (42,43). Disruption of 
the TCRa gene was measured by staining for CD3, a 
multi-protein complex that is associated with the a/P T- 
cell receptor. Strikingly, while disruption of the TCRa 
gene occurred at minimal rates in cells treated with the 
stand-alone mn (1.6%), the TCRa megaTAL exhibited a 
~20-fold increase in activity at the addressed target site in 
the TCRa gene (Figure 5b, representative flow plots are 
shown in Figure 5c). Co-transfection of each nuclease with 
Trex2 further increased the rate of disruption, yielding 
rates of TCRa gene disruption consistently exceeding 
70%. 

Both on- and off-target cleavage rates were analyzed by 
sequencing of the TCRa locus and putative off-target sites 
(identified based on in vitro cleavage profiling), in FACS 
sorted CD3- populations of nuclease/Trex2 treated ceUs 
(Figure 5d, Supplementary Figure S5a-c). For each treat- 
ment, modification rates at the TCRa locus were expected 
to be >50%, signifying knockout of the gene at the 
expressing allele in conjunction with some additional 
rate of modification of the nonexpressing allele. In 
megaTAL treated cells, we observed 70.4% modification, 
suggesting a substantial rate (c.a. 1 in 5 cells) of modifica- 
tion of the nonexpressed allele. In contrast, when we 
expressed the megaTAL with Trex2, we observed that 
only 52.0%) of the alleles carried indels, and even lower 
rates were obtained from the mn-alone treated cells. We 
attribute the differences in modification rates observed in 
the presence and absence of Trex2 to lowered expression 
of the nuclease in cells co-transfected with the exonucle- 
ase. The lower modification rates observed in the low 
abundance CD3- mn alone treated cells are attributable 
to contamination of the sorted populations with a larger 
proportion of background, unstained cells. The 
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Figure 4. megaTAL addressing selectively increases on-target cleavage, (a) I-Anil human genomic near-native sequences and their location in the 
genome. Synthetic TAL effectors were built against the +9T and +5A+8T sequences underlined in green and the I-Anil near-natives sequences are 
underlined in blue, (b) Level of gene targeting in TLR cells with endogenous human I-Anil near-native targets using 'addressed' +9T and +5A+8T 
megaTALs and 'unaddressed' I-Anil mns. mutNHEJ was not measured via TLR due to the presence of a stop codon in some target sites that 
prevented flow cytometry readout of these events, (c) Mutation rates at endogenous targets using 'addressed' I-Anil megaTALs and 'unadressed' 
mns. The level of mutNHEJ at the endogenous I-Anil near-native targets in 293T cells after 72 h of treatment with mn and megaTAL variants was 
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sequencing analysis also demonstrated that off-target 
cleavage in cells treated with the megaTAL alone is ex- 
tremely low, with only one of 22 potential sites exhibiting 
a rate of cleavage above background in the presence of 
Trex2 (OT23). Moreover, the megaTAL produced lower 
levels of mutNHEJ at several putative off-target loci than 
the stand-alone inn (i.e. OT18) — this result may reflect 
statistical variation in the deep sequencing results, or al- 
ternatively could be accounted for by reduced expression 
of the megaTAL nuclease compared with its shorter mn 
counterpart. 

DISCUSSION 

The megaTAL nuclease architecture described here 
addresses several key limitations of existing nuclease 



platforms for therapeutic applications, as it enables the 
generation of compact single chain nucleases that are ex- 
tremely active and specific. 

MegaTALs exhibit dramatic increases in cleavage 
activity compared with their isolated mn cleavage heads, 
an effect that appears to be an outcome of the auxiliary 
TAL effector DNA-binding domain docking ('address- 
ing') the mn adjacent to its target and effectively 
increasing the rate of enzyme-substrate complex forma- 
tion. Furthermore, the mn-mediated cleavage produces 
DNA ends with 3' overhangs that are efficiently processed 
by the Trex2 exonuclease, thus allowing the co-expression 
of a megaTAL and Trex2 to achieve unprecedentedly high 
targeted modification rates in primary ceUs of >70%. 

MegaTALs offer an extreme level of target specificity 
due to the combined DNA binding and cleavage 
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Figure 5. megaTAL targeting TCRoi achieves extremely high gene disruption with no detectable off-target cleavage in human primary T-cells. 
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the TCRa megaTAL alone (middle). Values calculated as <0 are given as '0'. Sequencing results were not obtained for putative off-targets OT12 and 
OT19 and are therefore not shown on the map; other samples for which sequencing results were not obtained are indicated as 'NA'. 



properties of the protein architecture. The activity of 
megaTALs is reliant on recognition of the tandem DNA 
targets of the TALE array and the mn, respectively, ef- 
fectively extending the length of the target site and 
reducing the hkelihood that more than one target site 
would be present in the human genome. Similarly, as 



low affinity mn cleavage heads are not able to efficiently 
access target sites to which they are not 'addressed' by the 
TALE array, the potential for cleavage at nonaddressed 
mn recognition sites can be markedly reduced. This is re- 
flected in the megaTAL used for TCRa gene knockout in 
primary human T-cells, which achieved 70% knockout 
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while above background off-target cleavage was limited to 
a single site (e.g. Figure 5d, and Supplementary Figure 5). 
As the single off-target cleavage site identified is in a 
noncoding region of the genome, this enzyme is of suffi- 
cient quahty at its present state of development for clinical 
use. However, a further advantage of the megaTAL archi- 
tecture is the potential to further 'tune' the mn cleavage 
head to specifically avoid off-target cleavage sites. 
Thus, we anticipate that iterative refinement of 
megaTALs cleavage heads will allow the eventual identi- 
fication of megaTALs that couple high efficiency with 
perfect on target genomic specificity. 

The megaTAL architecture also possesses several 
properties that facilitate efficient dehvery for therapeutic 
applications. MegaTALs can be expressed with a single 
~2kb ORF, smaller than two ZFN halves [2.2 kb total 
(1.1 kb per ZFN half targeting a 9-bp target)] and signifi- 
cantly shorter than a single TALEN half [>5.0kb total 
(2.5 kb per TALEN half targeting a 13-bp target)] or 
Cas9 nuclease (4.1 kb, not including guide RNA). Thus, 
megaTALs are compatible with efficient vectorization as a 
DNA, niRNA or with any widely used viral vector system. 
The capacity to use a TAL effector domain with as few as 
5.5 repeats also facihtates codon divergence of the TAL 
effector repeats to allow packaging into retroviral vectors, 
where recombination among TAL effector RVDs during 
reverse transcription can be problematic (25). FinaUy, the 
independent function of megaTALs due to their single 
chain design avoids the possibility of inappropriate 
heterodimerization that may occur with enzymes based 
on dimerization-dependent cleavage domains, such as 
Fokl, an important advantage for applications requiring 
multiplex genetic modifications. 
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